A Study of Generic Models for Unsupervised On-line Speaker Indexing

نویسندگان

  • Soonil Kwon
  • Shrikanth Narayanan
چکیده

On-line speaker indexing sequentially detects the points where a speaker identity changes in a multi-speaker audio stream, and classifies each speaker segment. This paper addresses two challenges: The first relates to monitoring which requires on-line processing. The second relates to the fact that the numberlidentity of the speakers is unknown. The indexing needs to be made in a unsupervised process. To address these issues, we apply a predetermined generic speaker-independent model set, Sample Speaker Model(SSM). This set can be useful for more accurate speaker modeling and clustering without requiring training models on target speaker data. Once a speakerindependent model is selected from the sample models, it is adapted into a speaker-dependent model progressively. Experiments were performed with Speaker Recognition Benchmark NIST Speech(1999). Results showed that our new technique, simulated using Markov Chain Monte Carlo Method, gave 92.47% indexing accuracy on telephone conversation data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On-line Speaker Indexing Project Role in Support of Imsc Strategic Plan Discussion of Methodology Used

Unsupervised speaker indexing sequentially detects points where a speaker identity changes in a multi-speaker audio stream, and categorizes each speaker segment, without any prior knowledge about the speakers. This project addresses two challenges: The first relates to sequential speaker change detection. The second relates to speaker modeling in light of the fact that the number/identity of th...

متن کامل

Iterative Unsupervised GMM Training for Speaker Indexing

The paper addresses a novel algorithm for speaker searching and indexation based on unsupervised GMM training. The proposed method doesn’t require a predefined set of generic background models, and the GMM speaker models are trained only from test samples. The constrain of the method is that the number of the speakers has to be known in advance. The results of initial experiments show that the ...

متن کامل

Speaker model quantization for unsupervised speaker indexing

Speaker indexing sequentially detects points where speaker identity changes in a multi-speaker audio stream, and classifies each detected segment according to the speaker’s identity. In unsupervised speaker indexing scenarios, there is no prior information/data about the speakers in the target data. To address this issue, a predetermined generic “speaker-independent” model set, called Sample Sp...

متن کامل

Unsupervised speaker indexing using anchor models and automatic transcription of discussions

We present unsupervised speaker indexing combined with automatic speech recognition (ASR) for speech archives such as discussions. Our proposed indexing method is based on anchor models, by which we define a feature vector based on the similarity with speakers of a large scale speech database. Several techniques are introduced to improve discriminant ability. ASR is performed using the results ...

متن کامل

A method for on-line speaker indexing using generic reference models

On-line Speaker indexing is useful for multimedia applications such as meeting or teleconference archiving and browsing. It sequentially detects the points where a speaker identity changes in a multi-speaker audio stream, and classifies each speaker segment. The main problem of on-line processing is that we can use only current and previous information in the data stream for any decisioning. To...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004